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Solution Team 


AMD! 


ABOUT THE DISPLAY SOLUTION TEAM АМОД 
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ABOUT DAL 


A BRIEF HISTORY 


DAL (Display Abstraction Layer) Team 


4 1998-2008: ΡΑΙ 1 
DVI, VGA, LVDS 
A Ccode architected for Windows 98 
2008-2015: DAL 2 
A DP1.1, HDMI® 1.x, HDCP 1.x 


4 С++ code architected for Windows® Vista. Leveraged into AMD Catalyst™ 
closed source driver in user mode 


A Later new concepts retrofitted into architecture 
4 HW layer composition 
Dynamic refresh rate (FreeSync) 


4 DP1.2MST 
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АМОД 


DAL name is somewhat historic 
We're trying to get away from it to 
Avoid confusion and 
Set right direction 
Maybe just “Display Core” 


ABOUT DAL 


Á 


2015+: DAL 3 
4 Eco-system, HW and OS evolving. 
4 New architecture is needed to comprehend the new concepts 


4 | DRM Atomic mode setting 

4 Dynamic refresh rate 

4 HW composition 

Management of shared HW resource 


4 Compression 


Wide Gamut Display and HDR support 
4 ОР 1.4, HDMI® 2, 4K@120, 5K, 8K, + more... 
4 Architecture allow code reuse of HW programming functions 


A Linux®, Android, Windows® 
A Emulation platforms, Diagnostic Test Suite 


A Lots of change to accommodate Linux? Kernel 
A С, less layer, gradually remove abstraction 


4 More to соте... 


Team focuses оп providing display end to end solutions on multiple platforms 
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АМОД 


ABOUT DAL AMD. 


... ON LINUX® 


® DAL2 in AMD Catalyst™ ΝΒ. DAL3 in AMD GPU-Pro 


4 User space driver | Ў | Kernel space driver 
A Closed-source C++ x 4 Open source C code 
4 Windows based architecture x | Native Linux® Kernel support 


retrofitted into AMD 


Catalyst™ for Linux? Enable early support for new ASIC with 


high quality 


8 | XDC2016 | SEPTEMBER 9, 2016 


ΒΕΥΟΝΡ CODE АМОД 


SOME OF WHAT OUR TEAM DOES 


| Proactively working with ecosystem and standard bodies 
A VESA, СТА 
A Display inter-operability 


A Compliance requirement 


4 Attend plug test and incorporated inter-operability fixes 


Inter-op and good user experience is 
beyond just code 
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DAL3 DESIGN АМОД 


Platform policies 


i ® 
OS specific handling Windows® Linux 
- cursor, surface, composition Display (amdgpu) HW 
- Mode reporting and setting Display diagnostic 
- Display Capability (EDID, timing) Manager 
- Virtual target (tiled display) Manager 
- Display Adjustment 
- Use DRM functionality where available or contribute = AMD 


RADEON 


GRAPHICS 


Display Core 


DCE 11 94 VESA 


DCE 11.2 


AMD. 


FREESYNC 


TECHNOLOGY 


DER [5) DisplayPort 


HDMI 


HIGH-DEFINITION MULTIMEDIA INTERFACE 
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DAL3 DESIGN — DISPLAY CORE 


Surface 


Visible 
address Τὰ rget 
Flip mode al 
(composition 


Scaling quality destination) 


Src/dst rects 


Size Cursor 
Pitch 


Tiling 


Colorimetry 
Gamma correction 


Virtual HW blocks 


Stream 


Timing param 
Color depth 
Pixel encoding 


Src/dst rect 
Dither 
Csc regamma 


gamut remap 


audio 
info packet 


DC interface structures 


Link Sink 


(output 
connector on 


GPU) DDC/Aux 


EDID 


I2c/Aux Override 


override 


DP MST Sink cap retrieval 


Link Training 
Dongle detect 
HDCP 


Link manager 
& detection 
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АМОД 


DAL3 DESIGN АМОД 


BENEFITS OF DC/DM DESIGN 


DC = Display Core 
4 Leverage hundreds of hours of testing coverage DM = Display Manager 


4 Across different display types 


A DP MST, HDMI 2.0, FreeSync 
A 4k,5k modes. 120Hz, 144Hz 
4 Multiple display combinations. Up to 6 displays 


á Daily coverages. Manual + automation 


Common code base benefits both Linux° and other OS 
drivers 
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DAL3 Walkthrough 


AMD! 


DAL3 HW CORE LAYER DESIGN АМОД 


Surface Stream 


enable 
address Τὰ rget кае 
Fli 
κα (composition Color depth 
destination) Pixel encoding 


Scaling quality Link Sink 


Src/dst rects Src/dst rect 


DC interface structures 


(output 
si | 
іге бої Dither connector on 
Pitch Csc regamma GPU) DDC/Aux 
Tiling gamut remap EDID 
| I2c/Aux Override 
Colorimetry audio override 
Gamma correction info packet 


DP MST Sink cap retrieval 
Link Training 


dc create stream for sink (sink) Dongle detect 
dc create target for streams (streams[]) іі 
dc_create_surface() 

dc_validate_resources ( set of (target + surfaces[]) ) 

dc commit targets (targets[]) 

dc commit surfaces to target(surfaces[], target) 


15 | XDC2016 | SEPTEMBER 9, 2016 


DM_ATOMIC_COMMIT АМОД 


if ('азупс) | 
ret = drm atomic helper prepare planes(dev, state); 
if (ret) 
return ret; 


) 


drm atomic helper swap state(dev, state); 
drm atomic helper update legacy modeset state(dev, state); 
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DM_ATOMIC_COMMIT 


/* update changed items */ 
TOF each erte in State (stave, ОЕ ‘state, 1) 
action = get dm commit action(new state); 
DM COMMIT ACTION SET: { 
new target = create target for sink( 

aconnector, 
&crtc-»state-»mode, 
dm state); 


(!new target) 


, 


(acrtc->target) 
remove target ladev, acrcec); 


new crtoslnew crtcs Count] = aorto? 
new ertes Gount++; 


acrto-»orargert = new target; 


acrtc->enabled = true; 
aocrtossnw mode crtc->state->mode; 
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DM_ATOMIC_COMMIT АМОД 


list for each entry(crto, &dev-»mode config.crtco list; head) 4 
stroet amdgpu crite *acrtc = бо amdgpu crtc(crtc); 


if (acrtc->target) { 
commit targetsloommit targets count] = àortc-»target; 


чевсошна Є targets Counc? 


/* DC is optimized not to do anything if 'targets' didn't change. */ 
ας commit сагдегвадатерас, commit Largers, Commit targets count); 
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DC_COMMIT_TARGETS АМОД 


if (false == targets changed(core ас, targets, target count)) 
return DC ОК; 
result = core dc-»res pool->funcs->validate with context ( 
core ασ, Ser, Tarde: Counc, context); 
pplib apply safe state (соге ас); 
if (!dcb->funcs->is accelerated mode (dcb)) 


core dcoc-»hwss.enable accelerated mode (соге ас); 
if (result == DC OK) { 
result = core dc-»hwss.apply ctx to hw(core dc, context); 


) 
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DCE110 VALIDATE WITH. CONTEXT 


(σσ validate surrace SES (SEL, Set Count)) 
DC FAL. SURFACE VALIDATE? 


CONEXE Pre. GLX.pool = do-esres pool; 
(120; 1 < set count; itt) | 
context->targets[i] = DC TARGET TO CORE(set[i].target); 
COTLOXL-PMLarget Counttt, 


result = resource map pool resources(dc, context); 


(result == DC OK) 
result = resource map clock resources (de, context); 
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DCE110 VALIDATE WITH. CONTEXT 


(resource validats attach, surraces ( 
ser, вер count, Ос->епттег: Context, Context) d 


DC ERROR["Farled to attach surface to Large! п"); 
DC FAIL ATTACH SURFACES; 


(result == DC OK) 
result = validate mapped resource(dc, context); 


(result == DC OK) 
result = resource build scaling params Tor context (dc, context); 


(result == DC OK) 


result = dcellO0 validate bandwidth(dc, context); 


result; 
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DC_COMMIT_TARGETS АМОД 


if (false == targets changed(core ас, targets, target count)) 
return DC ОК; 
result = core dc-»res pool->funcs->validate with context ( 
core ασ, Ser, Tarde: Counc, context); 
pplib apply safe state (соге ас); 
if (!dcb->funcs->is accelerated mode (dcb)) 


core dcoc-»hwss.enable accelerated mode (соге ас); 
if (result == DC OK) { 
result = core dc-»hwss.apply ctx to hw(core dc, context); 


) 


22 | XDC2016 | SEPTEMBER 9, 2016 


APPLY_CTX_TO_HW 


Dr- тож МАХ PIPES; 1++) 
(!pipe ctx->strean | | 
pipe need reprogram(pipe ctx old, pipe ctx)) 
reset single pipe hw сіх ( 
de, pipe ctx old, do-»current context): 


: 0; i < MAX PIPES; i++) ( 


(pipe ctx->stream == pipe ctx old->stream) { 
(pipe ctx old-»5clock source Те pipe ctx->clock source) 
desshwss,crte switch. ἜΘ СІК sret 
pipe CUR RC LOCKE source, 217% 


} 
dc->hwss.enable display power gating( 


do, i, аср, 
PIPE GATING CONTROL DISABLE ) 4 
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APPLY_CTX_TO_HW 


set safe displaymarks (&context->res ctx); 


(context-»bw results.dispelk khz 
> dc-»current context-»bw results.dispclk knhz) 


set display clock(context); 


(i = 0; i < МАХ PIPES; i++) Í 
(pipe ctx->stream == 
(pipe ctx-»sstream pipe ctx old->stream) 
status = apply single controller ctx to hw 
pipe ctx, 
context, 
dc); 
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APPLY SINGLE CONTROLLER CTX TO HW 


if (!pipe ctx old->stream) Í 
pipe ctx-»rg-»runcs-»set bläankipıpe Ctx-»tg, true); 


pipe ctx-»clock source-»runcs-»program pix: οκ 
pipe Ctx-»cloCK source, 
&pipe ctx->pix clk params, 
&pipe сїх=>р11 settings) ) 


pipe ctx-»tg-»runcs-»program timing ( 
Pipe CC LO, 
&stream->public.timing, 
true); 


pipe ctx->mi->funcs->allocate mem input ( 
pipe СЕК, 
stream->public.timing.h total, 
SLtream-»public,.timring.v total, 
stream-»public.timing.pix сік КПа, 
context->target. count); 
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TG->FUNCS->SET_BLANK 


26 


set reg field value 
value, 
1 / 
CRTC BLANK CONTROL, 
СВТС ВІАМК DATA EN); 
dm write reg(tg->ctx, CRTC REG (mmCRIC BLANK CONTROL), value); 


for (counter = 0; counter < 100; counter++) { 
value = dm read reg(tg->ctx, CRTC КЕС (mmCRTC BLANK CONTROL)); 


if (get reg field value ( 
value, 
CRTC_BLANK CONTROL, 
CRTC BLANK DATA ЕМ) == 1 66 
get reg field value( 
value, 
CRTC BLANK CONTROL, 
CRTC CURRENT BLANK STATE) == 1) 
break; 


msleep(1); 


APPLY SINGLE CONTROLLER CTX TO HW 


(гаре еск oldesstream) 
pipe ctx-etog-sruncsesenable СОС 
pipe ctx-»tg)) 


DOS parser orte Source select (pipe: сбх)) 


pipe ctx->opp->funcs->opp set dyn expansion( 
pipe OUtX-2^ODD 
COLOR SPACE YCBCR601, 


Stream-»public,timing,.display color depth, 
pipe ctx->stream->signal); 


pipe ctx->opp->funcs->opp program fmt( 
pipe сїх=>орр, 
&stream->bit depth params, 
&stream->clamping) ; 


27 | XDC2016 | SEPTEMBER 9, 2016 


APPLY SINGLE CONTROLLER CTX TO HW 


stream-»sink-»1ink-»link enc-»funcs-»setup( 
Stream-»ssink-»limk-»1ink eno; 
pipe octx-»stream-»signal); 


(dc is dp signal (pipe ctx->stream->signal)) 
pipe otx-»stream enc-»runcs-»dp set Stream attribute(t 
pipe CCS EL Sal ene, 
&stream->public.timing); 
/* same for hdmi and dvi */ 


color space Бо black colori 

stream->public.output color space, &black color); 
pipe ctx-»tg-»runcs-»set blank color í 

pipe ctx->tg, 

&black color); 
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APPLY SINGLE CONTROLLER CTX TO HW 


(!pipe ctx old-»stream) | 
core Link enable Streampprpe CCX)? 


кас їв dp signal (pipe ctx-»streanm-»srgnal)) 
unblank stream(pipe ctx, 
&stream->sink->link->public.cur link settings); 


(Іріре ctx old || memcmp(&pipe ctx old-»scl data, 
&pipe ctx-»scl data, 
( scaler data)) 
program зсаїеєс(ас, pipe Ctx)? 


DC OK; 
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APPLY_CTX_TO_HW 


(і = 0; і < MAX PIPES; i++) 1 


(context-»res ctx.pipe ctx[i].audio != 


build. audio. output (pipe ctx, &audio Output); 
dal audio setup 
pipe ctx->audio, 
&audio output, 
єріре ctx->stream->public.audio info)) 
(!programmed audio dto) | 
dal audio setup audio wall dto( 
pipe ctx-»audio, 
pipe ctx->stream->signal, 
&audio output.crtc info, 
&andio output.pll info); 


programmed audio dto = true; 
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APPLY_CTX_TO_HW АМОД 


dc->hwss,set displaymarks(dc, context); 


switch ар clock sources (dc, &context-»res Ctx); 


DC OK; 
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DC_COMMIT_TARGETS АМОД 


program TIMING. вупс(|соге de, context); 
for (1 = 0; 1 < context-»target count; itt) | 
LE (context-»Ltarget status |i] „surface count > 9) 
target enable memory requests(dc target, 


Score do-»currenL Conlext=sres єрх); 


) 
pplib apply display requirements (core dc, 
context, Scöntext->pp display ста); 
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DM_ATOMIC_COMMIT 


/* update planes when needed */ 
ror each plane in State (state, plane, old plane state; 1) 4 
if (!fb || ferte || Tferto-»state-»planes changed || 
Icrtc->state->active) 
continue; 
action = get dm commit action (crtc->state); 


if (!page flip needed(plane state, old plane state, true) || 
action == DM COMMIT ACTION DPMS ON || 
action == DM COMMIT ACTION SET) { 
wait while priip status(adev, acrtc, ρε το pending predicate); 


dm de surface commit(om-»dc, ccrtc); 


for (i = 0; i < new crtcs count; itt) | 
manage dm Interrüptstadev, асгсс, True)? 
dm crto cursor reset(&acrtc-2ba5e);j 


DM_ATOMIC_COMMIT 


/* Page flip if needed */ 

tor each plane іп State (state, plane, old plane state, 1) 1 
drm plane state "plane State = plans 32:5; 
drm cree Sorto = plane SLate-scrtc; 
amdopu crte ACETIC = to Aamdgpu CE CC косу; 
drm framebuffer *fb = plane state-»fb; 


(page flip needed(plane state, old plane state, false)) 1 
ret = апсари crio page Elıplerte, 
ED, 
crtc->state->event, 
acrto-»rlip flags); 
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Challenges & Takeaway 


AMD! 


CHALLENGES & TAKEAWAY АМОД 


CHALLENGES 


Delivering new feature / functionality with DRM dependency in Distro 
A e.g. Ubuntu uses kernel 4.4 


4 Requires us to work on feature a year or half ahead 


A What if spec isn't even clear at that point? 


4 Some example 
4 SK DisplayID parsing code is in Kernel 4.7 


4 НОМІ? 2.0 CEA extension parsing for 4K(060 modes 


A X We started patching MST early this year but we still don't have MST working in Ubuntu 
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CHALLENGES & TAKEAWAY АМОД 


TAKEAWAY 


A DAL3 Architecture 
4 Designed with Linux® in mind 
A Ready for future technologies we anticipate from the industry 


4 Committed to Linux® 
4 Willing to work together 
4 Eager to share our experience in the industry with the community 


37 | XDC 2016 | SEPTEMBER 9, 2016 


16 | SEPTEMBER 9, 20 


XDC 
> 


"A 


DISCLAIMER & ATTRIBUTION АМОД 


The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. 


The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new 
model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise 
this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. 


AMD MAKES ΝΟ REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS 
INFORMATION. 


AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR 
OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. 


ATTRIBUTION 
© 2016 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. Windows is a 


registered trademark of Microsoft Corporation. HDMI is a trademark of HDMI Licensing, LLC. Linux is a registered trademark of Linus Torvalds. Other names used herein are for identification purposes only and may be 
trademarks of their respective companies. Other names are for informational purposes only and may be trademarks of their respective owners. 
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